智能论文笔记

Advances in Prediction of Readmission Rates Using Long Term Short Term Memory Networks on Healthcare Insurance Data

Shuja Khalid , Francisco Matos , Ayman Abunimer , Joel Bartlett , Richard Duszak , Michal Horny , Judy Gichoya , Imon Banerjee , Hari Trivedi

分类：机器学习 | 人工智能

2022-06-30

30天的医院再入院是一个长期存在的医疗问题，会影响患者的发病率和死亡率，每年造成数十亿美元的损失。最近，已经创建了机器学习模型来预测特定疾病患者的住院再入院风险，但是不存在任何模型来预测所有患者的风险。我们开发了一个双向长期记忆（LSTM）网络，该网络能够使用随时可用的保险数据（住院访问，门诊就诊和药物处方）来预测任何入院患者的30天重新入选，无论其原因如何。使用历史，住院和入院后数据时，表现最佳模型的ROC AUC为0.763（0.011）。 LSTM模型显着优于基线随机森林分类器，表明了解事件的顺序对于模型预测很重要。与仅住院数据相比，与住院数据相比，将30天的历史数据纳入也显着改善了模型性能，这表明患者入院前的临床病史，包括门诊就诊和药房数据是重新入院的重要贡献者。我们的结果表明，机器学习模型能够使用结构化保险计费数据以合理的准确性来预测住院再入院的风险。由于可以从网站中提取计费数据或同等代理人，因此可以部署此类模型以识别有入院风险的患者，或者分配更多可靠的随访（更近的后续后续，家庭健康，邮寄药物） - 出院后风险患者。

translated by 谷歌翻译

MedShift: identifying shift data for medical dataset curation

Xiaoyuan Guo , Judy Wawira Gichoya , Hari Trivedi , Saptarshi Purkayastha , Imon Banerjee

分类：计算机视觉

2021-12-27

为了策划高质量的数据集，识别内部和外部来源之间的数据方差是一个基本和关键的步骤。但是，尚未显着研究检测数据移位或差异的方法。对此的挑战是缺乏学习DataSet的密集代表和在医疗机构分享私人数据的困难的有效方法。为了克服这些问题，我们提出了一个统一的管道，称为MedShift以检测顶级移位样本，从而促进医疗策序。给定内部数据集A作为基础源，我们首先为每类数据集A列车以以无人监督的方式学习内部分布。其次，在不交换跨源的情况下，我们在每个类的外部数据集b上运行训练的异常检测器。具有高异常分数的数据样本被识别为移位数据。为了量化外部数据集的换档，我们将B的数据基于所获得的分数群集分组。然后，我们通过逐渐删除每个类的最大异常分数来测量B的多级分类器并测量与分类器的性能方差的班次。此外，我们还调整数据集质量指标，以帮助检查多个医疗源的分布差异。我们验证了来自肌肉骨骼射线照片（Mura）和胸部X射线数据集的MedShift的疗效，来自多个外部源。实验表明我们所提出的移位数据检测管道对医疗中心有益，以更有效地策划高质量的数据集。一个接口介绍视频，可视化我们的结果可在https://youtu.be/v3bf0p1sxqe上获得。

translated by 谷歌翻译

Two-step adversarial debiasing with partial learning -- medical image case-studies

Ramon Correa , Jiwoong Jason Jeong , Bhavik Patel , Hari Trivedi , Judy W. Gichoya , Imon Banerjee

分类：计算机视觉 | 机器学习

2021-11-16

在过去几年中，在医疗保健中使用人工智能（AI）已成为一个非常活跃的研究领域。虽然在图像分类任务中取得了重大进展，但实际上只能部署一些AI方法。目前积极使用临床AI模型的主要障碍是这些模型的可信度。这些复杂模型更常见，是一种黑色盒子，其中产生了有希望的结果。然而，当仔细检查时，这些模型开始在决策期间揭示隐式偏差，例如检测种族并对民族群体和群体具有偏见。在我们正在进行的研究中，我们开发了一个两步的逆势脱叠方法，部分学习可以减少种族差异，同时保留目标任务的性能。该方法已经在两个独立的医学图像案例研究 - 胸X射线和乳房X光检查中进行了评估，并在保持目标性能的同时表现出偏差减少的承诺。

translated by 谷歌翻译

Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions

Harsh Trivedi , Niranjan Balasubramanian , Tushar Khot , Ashish Sabharwal

分类：自然语言处理

2022-12-20

Recent work has shown that large language models are capable of generating natural language reasoning steps or Chains-of-Thoughts (CoT) to answer a multi-step question when prompted to do so. This is insufficient, however, when the necessary knowledge is not available or up-to-date within a model's parameters. A straightforward approach to address this is to retrieve text from an external knowledge source using the question as a query and prepend it as context to the model's input. This, however, is also insufficient for multi-step QA where \textit{what to retrieve} depends on \textit{what has already been derived}. To address this issue we propose IRCoT, a new approach that interleaves retrieval with CoT for multi-step QA, guiding the retrieval with CoT and in turn using retrieved results to improve CoT. Our experiments with GPT3 show substantial improvements in retrieval (up to 22 points) and downstream QA (up to 16 points) over the baselines on four datasets: HotpotQA, 2WikiMultihopQA, MuSiQue, and IIRC. Notably, our method also works well for much smaller models such as T5-Flan-large (0.7B) without any additional training.

translated by 谷歌翻译

Online Convex Optimization of Programmable Quantum Computers to Simulate Time-Varying Quantum Channels

Hari Hara Suthan Chittoor , Osvaldo Simeone , Leonardo Banchi , Stefano Pirandola

分类：人工智能 | 机器学习

2022-12-09

Simulating quantum channels is a fundamental primitive in quantum computing, since quantum channels define general (trace-preserving) quantum operations. An arbitrary quantum channel cannot be exactly simulated using a finite-dimensional programmable quantum processor, making it important to develop optimal approximate simulation techniques. In this paper, we study the challenging setting in which the channel to be simulated varies adversarially with time. We propose the use of matrix exponentiated gradient descent (MEGD), an online convex optimization method, and analytically show that it achieves a sublinear regret in time. Through experiments, we validate the main results for time-varying dephasing channels using a programmable generalized teleportation processor.

translated by 谷歌翻译

DroneAttention: Sparse Weighted Temporal Attention for Drone-Camera Based Activity Recognition

Santosh Kumar Yadav , Achleshwar Luthra , Esha Pahwa , Kamlesh Tiwari , Heena Rathore , Hari Mohan Pandey , Peter Corcoran

分类：计算机视觉

2022-12-07

Human activity recognition (HAR) using drone-mounted cameras has attracted considerable interest from the computer vision research community in recent years. A robust and efficient HAR system has a pivotal role in fields like video surveillance, crowd behavior analysis, sports analysis, and human-computer interaction. What makes it challenging are the complex poses, understanding different viewpoints, and the environmental scenarios where the action is taking place. To address such complexities, in this paper, we propose a novel Sparse Weighted Temporal Attention (SWTA) module to utilize sparsely sampled video frames for obtaining global weighted temporal attention. The proposed SWTA is comprised of two parts. First, temporal segment network that sparsely samples a given set of frames. Second, weighted temporal attention, which incorporates a fusion of attention maps derived from optical flow, with raw RGB images. This is followed by a basenet network, which comprises a convolutional neural network (CNN) module along with fully connected layers that provide us with activity recognition. The SWTA network can be used as a plug-in module to the existing deep CNN architectures, for optimizing them to learn temporal information by eliminating the need for a separate temporal stream. It has been evaluated on three publicly available benchmark datasets, namely Okutama, MOD20, and Drone-Action. The proposed model has received an accuracy of 72.76%, 92.56%, and 78.86% on the respective datasets thereby surpassing the previous state-of-the-art performances by a margin of 25.26%, 18.56%, and 2.94%, respectively.

translated by 谷歌翻译

Machine Learning for Smart and Energy-Efficient Buildings

Hari Prasanna Das , Yu-Wen Lin , Utkarsha Agwan , Lucas Spangher , Alex Devonport , Yu Yang , Jan Drgona , Adrian Chong , Stefano Schiavon , Costas J. Spanos

分类：机器学习

2022-11-27

Energy consumption in buildings, both residential and commercial, accounts for approximately 40% of all energy usage in the U.S., and similar numbers are being reported from countries around the world. This significant amount of energy is used to maintain a comfortable, secure, and productive environment for the occupants. So, it is crucial that the energy consumption in buildings must be optimized, all the while maintaining satisfactory levels of occupant comfort, health, and safety. Recently, Machine Learning has been proven to be an invaluable tool in deriving important insights from data and optimizing various systems. In this work, we review the ways in which machine learning has been leveraged to make buildings smart and energy-efficient. For the convenience of readers, we provide a brief introduction of several machine learning paradigms and the components and functioning of each smart building system we cover. Finally, we discuss challenges faced while implementing machine learning algorithms in smart buildings and provide future avenues for research at the intersection of smart buildings and machine learning.

translated by 谷歌翻译

Bounding Box Priors for Cell Detection with Point Annotations

Hari Om Aggrawal , Dipam Goswami , Vinti Agarwal

分类：计算机视觉

2022-11-11

The size of an individual cell type, such as a red blood cell, does not vary much among humans. We use this knowledge as a prior for classifying and detecting cells in images with only a few ground truth bounding box annotations, while most of the cells are annotated with points. This setting leads to weakly semi-supervised learning. We propose replacing points with either stochastic (ST) boxes or bounding box predictions during the training process. The proposed "mean-IOU" ST box maximizes the overlap with all the boxes belonging to the sample space with a class-specific approximated prior probability distribution of bounding boxes. Our method trains with both box- and point-labelled images in conjunction, unlike the existing methods, which train first with box- and then point-labelled images. In the most challenging setting, when only 5% images are box-labelled, quantitative experiments on a urine dataset show that our one-stage method outperforms two-stage methods by 5.56 mAP. Furthermore, we suggest an approach that partially answers "how many box-labelled annotations are necessary?" before training a machine learning model.

translated by 谷歌翻译

SWTF: Sparse Weighted Temporal Fusion for Drone-Based Activity Recognition

Santosh Kumar Yadav , Esha Pahwa , Achleshwar Luthra , Kamlesh Tiwari , Hari Mohan Pandey , Peter Corcoran

分类：计算机视觉

2022-11-10

Drone-camera based human activity recognition (HAR) has received significant attention from the computer vision research community in the past few years. A robust and efficient HAR system has a pivotal role in fields like video surveillance, crowd behavior analysis, sports analysis, and human-computer interaction. What makes it challenging are the complex poses, understanding different viewpoints, and the environmental scenarios where the action is taking place. To address such complexities, in this paper, we propose a novel Sparse Weighted Temporal Fusion (SWTF) module to utilize sparsely sampled video frames for obtaining global weighted temporal fusion outcome. The proposed SWTF is divided into two components. First, a temporal segment network that sparsely samples a given set of frames. Second, weighted temporal fusion, that incorporates a fusion of feature maps derived from optical flow, with raw RGB images. This is followed by base-network, which comprises a convolutional neural network module along with fully connected layers that provide us with activity recognition. The SWTF network can be used as a plug-in module to the existing deep CNN architectures, for optimizing them to learn temporal information by eliminating the need for a separate temporal stream. It has been evaluated on three publicly available benchmark datasets, namely Okutama, MOD20, and Drone-Action. The proposed model has received an accuracy of 72.76%, 92.56%, and 78.86% on the respective datasets thereby surpassing the previous state-of-the-art performances by a significant margin.

translated by 谷歌翻译

Predictive Scale-Bridging Simulations through Active Learning

Satish Karra , Mohamed Mehana , Nicholas Lubbers , Yu Chen , Abdourahmane Diaw , Javier E. Santos , Aleksandra Pachalieva , Robert S. Pavel , Jeffrey R. Haack , Michael McKerns

分类：机器学习 | 人工智能 | (统计)机器学习

2022-09-20

在整个计算科学中，越来越需要利用原始计算马力的持续改进，通过对蛮力的尺度锻炼的尺度增加，以增加网状元素数量的增加。例如，如果不考虑分子水平的相互作用，就不可能对纳米多孔介质的转运进行定量预测，即从紧密的页岩地层提取至关重要的碳氢化合物。同样，惯性限制融合模拟依赖于数值扩散来模拟分子效应，例如非本地转运和混合，而无需真正考虑分子相互作用。考虑到这两个不同的应用程序，我们开发了一种新颖的功能，该功能使用主动学习方法来优化局部细尺度模拟的使用来告知粗尺度流体动力学。我们的方法解决了三个挑战：预测连续性粗尺度轨迹，以推测执行新的精细分子动力学计算，动态地更新细度计算中的粗尺度，并量化神经网络模型中的不确定性。

translated by 谷歌翻译